UT Dialogue System at NTCIR-12 STC

نویسندگان

Shoetsu Sato

Shonosuke Ishiwatari

Naoki Yoshinaga

Masashi Toyoda

Masaru Kitsuregawa

چکیده

This paper reports a dialogue system developed at the University of Tokyo for participation in NTCIR-12 on the short text conversation (STC) pilot task. We participated in the Japanese STC task on Twitter and built a system that selects plausible responses for an input post (tweet) from a given pool of tweets. Our system first selects a (small) set of tweets as response candidates from the pool of tweets by exploiting a kernel-based classifier. The classifier uses bagof-words in an utterance and a response (candidate) as features. We then perform re-ranking of the chosen candidates in accordance with the perplexity given by Long Short-Term Memory-based Recurrent Neural Network (lstm-rnn) to return a ranked list of plausible responses. In order to capture the diversity of domains (topics, wordings, writing styles, etc.) in chat dialogue, we train multiple lstm-rnns from subsets of utterance-response pairs that are obtained by clustering of distributed representations of the utterances, and use the lstm-rnn that is trained from the utteranceresponse cluster whose centroid is the closest to the input tweet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

YUILA at the NTCIR-12 Short Text Challenge: Combining Twitter Data with Dialogue System Logs

The YUILA team participated in the Japanese subtask of the NTCIR-12 Short Text Challenge task. This report describes our approach to solving the responsiveness problem in STC task by using external dialogue log corpus and discusses the official results.

متن کامل

Utterance Selection Based on Sentence Similarities and Dialogue Breakdown Detection on NTCIR-12 STC Task

This paper describes our contribution for the NTCIR-12 STC Japanese task. The purpose of the task is to retrieve tweets that suits as responses of a chat-oriented dialogue system from a huge number of tweets pool. Our system retrieves tweets based on following two steps: first it retrieves tweets that resemble to input sentences, and then, it filters inappropriate tweets in terms of the dialogu...

متن کامل

Microsoft Research Asia at NTCIR-12 STC Task

This paper describes our approaches at NTCIR-12 short text conversation (STC) task (Chinese). For a new post, instead of considering post-comment similarity, our system focus on finding similar posts in the repository and retrieve their corresponding comments. Meanwhile, we choose frequency property of comments to adjust ranking models. Our best run achieves 0.4854 for mean P, 0.3367 for mean n...

متن کامل

SLSTC at the NTCIR-12 STC Task

The SLSTC team participated in the NTCIR-12 Short Text Conversation (STC)[1] task. This report describes our approach to solving the STC problem and discusses the ocial results.

متن کامل

BUPTTeam Participation in NTCIR-12 Short Text Conversation Task

Abstract This paper provides an overview of BUPTTeam’s system participated in the Short Text Conversation (STC) task of Chinese at NTCIR-12. STC is a new NTCIR challenging task which is defined as an IR problem, i.e., retrieval based a repository of postcomment pairs from Sina Weibo. In this paper, we propose a novel method to retrieve post result from the repository based on the following four...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

UT Dialogue System at NTCIR-12 STC

نویسندگان

چکیده

منابع مشابه

YUILA at the NTCIR-12 Short Text Challenge: Combining Twitter Data with Dialogue System Logs

Utterance Selection Based on Sentence Similarities and Dialogue Breakdown Detection on NTCIR-12 STC Task

Microsoft Research Asia at NTCIR-12 STC Task

SLSTC at the NTCIR-12 STC Task

BUPTTeam Participation in NTCIR-12 Short Text Conversation Task

عنوان ژورنال:

اشتراک گذاری